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Abstract 

This paper studies a simple class of zero-sum games played by two competing quantum 
players: each player sends a mixed quantum state to a referee, who performs a joint measure- 
ment on the two states to determine the players' payoffs. We prove that an equilibrium point of 
any such game can be approximated by means of an efficient parallel algorithm, which implies 
that one-turn quantum refereed games, wherein the referee is specified by a quantum circuit, 
can be simulated in pol3momial space. 

1 Introduction 

The theory of games has been studied extensively in mathematics and in several other disciplines 
for which it has applications. In theoretical computer science, computational aspects of game 
theory represent an important focus of the field. 

There are several settings of interest to quantum computation and quantum cryptography 
that are naturally modeled by quantum games, which are games involving the exchange and pro- 
cessing of quant um information. For instance, multi-prover quantum interactive proofs IIKM031 
ICHTW041IkKM+ 08. KK MVOSH can be modeled as cooperative quantum games; quantum coin- 
flipping IIAmb01liKit02tiSR02llMoc04llMoc07ll is naturally modeled as a game between two players 
that directly exchange quantum information; and quantum refereed games IGutOSl IGW051 |GW07[ 
are competitive games that model quantum interactive proofs with competing provers. 

In this paper we consider a simple type of non-interactive, zero-sum quantum game: two 
competing players (hereafter called Alice and Boh) each send a mixed quantum state to a referee, 
who performs a joint measurement on the two states to determine the players' payoffs. For a fixed 
description of the referee, let (p{p, a) denote Alice's expected payoff when she sends a mixed state 
p to the referee and Bob sends a mixed state a. (For zero-sum games, Bob's payoff is then given 
by —(p{p, cr).) The theory of quantum information requires the function (p{p, a) to be bilinear, from 
which it follows that 

maxmin(*(jO, cj) = minmax(*(p, cr) (1) 

pa a p 

from well-known variants of the Min-Max Theorem. (Indeed, such a fact holds for a much more 
general class of quantum zero-sum games that can allow for many rounds of interaction among 
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the referee and players IIGW07I .) The value represented by the two sides of the equation ([T]) is 
called the value of the game. An equilibrium point of such a game is a pair of quantum states {p, a) 
such that 

max^(|0', (j) = <p{p,cr) = min^(p, cr'), 

p' a' 

the existence of which follows from the equation (|T]). In other words, when one player plays 
one of the states of an equilibrium point, the other has no incentive to play a state different from 
the other state in the pair. (These notions are, of course, similar to those for classical zero-sum 
games, but with some technical differences due to the nature of quantum information. In par- 
ticular, there is a continuum of pure strategies for quantum players, corresponding to what are 
known as pure quantum states.) An equilibrium point of a zero-sum quantum game, given as 
and explicit description of the referee's measurement, can be efficiently computed by means of 
semidefinite programming. 

The main result of this paper is an efficient parallel algorithm to find approximate equilibrium 
points of non-interactive zero-sum quantum games. For the case where the referee is specified by 
a quantum circuit rather than in explicit matrix form, this algorithm implies that the value of such 
a game can be approximated in polynomial space. More succinctly, it implies that the complexity 
class QRG(l) of problems having one-turn quantum refereed games is contained in PSPACE. 

Our algorithm is an example of the multiplicative weights update method, which is discussed 
in the papers |AHK05bl ,TRW05J, for instance, and is explained in detail in the PhD thesis of 
S. Kale |Kal07L This general method captures many previously discovered (and sometimes re- 
discovered) algorithms, and has origins in learning theory, game theory, and optimization. The 
specific formulation of our algorithm is a non-commutative extension of an (unpublished) algo- 
rithm of Rohit Khandekar and the first author (Rahul Jain) that approximates equilibrium points 
of classical games. 

In the sections that follow, we give relevant definitions from the theory of quantum informa- 
tion, present the algorithm and its analysis, and discuss the containment QRG(l) C PSPACE that 
follows. We also explain how the problem of finding equilibrium points of quantum games relates 
to the problem of approximating positive instances of semidefinite programs. 

2 Preliminaries and definitions 

This section gives a brief summary of the quantum information-theoretic concepts that are needed 
in the paper, and then defines non-interactive zero-sum quantum games. A few additional defini- 
tions that will be helpful later in the paper are also discussed. 

2.1 Basic quantum information-theoretic notions 

In this paper we require just a few basic concepts about quantum information; so it is not neces- 
sarily required that the reader has any prior familiarity with it. 

When we refer to a quantum register we simply mean a discrete quantum system that we wish 
to consider, such as a collection of qubits representing a message transmitted from one party to 
another. With any quantum register we associate some vector space X = C" for a positive integer 
n that intuitively represents the maximum number of distinct classical states that could be stored 
in the register without error. A state of such a register is represented by a density matrix, which is 
an X n positive semidefinite matrix having trace equal to 1. Density matrices may reasonably be 
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viewed as the quantum information-theoretic analogue to a vector of probabilities, representing a 
probability distribution. We will write D (A") to denote the set of all density matrices associated 
with a register that is described by Af. It is natural to view such density matrices as linear operators 
acting on X, and for this reason the term density operator is commonly used in place of density 
matrix. 

When two registers having associated spaces X = C" and y = C" are considered as a single 
compound register, the associated space becomes the tensor product space X ®y = C"^. If the 
two registers are independently prepared in states described by p and a, respectively, then the 
joint state is described by the nm x nm density matrix p®a. This matrix may be written in block 
form as 

(Pl,lO- ■ ■ ■ pi,nO-\ 
: • 
Pn.lO' ■ ■ ■ Pn,n(^J 

In general, for a vector space X = C", we write 1^{X) to denote the set of all n x n complex 
matrices, or linear operators mapping X to itself, and we write Herm [X) to refer to the subset 
of L [X) given by the Hermitian matrices. These are the matrices A satisfying A — A*, where A* 
denotes the adjoint or conjugate transpose of A. The set Herm [X) forms a vector space over R, and 
many optimization methods designed for real-valued symmetric matrices extend to Herm (X) 
with little or no special consideration. Finally, we write Pos (X) to denote the subset of Herm (X) 
that consists of all positive semidefinite n x n matrices (or operators acting on X). 

The Hilbert-Schmidt inner product onL{X) is defined as 

{A,B) = Tr(A*B) 

for all A, B G L[X). It holds that (A, B) is a real number for all choices of Hermitian matrices A 
and B, and is a nonnegative real number for all choices of positive semidefinite matrices A and B. 

A measurement of a register, having an associated vector space X = C", is described by a 
collection of n x n positive semidefinite matrices that sum to the identity. Specifically, a mea- 
surement that has some finite, non-empty set E of possible outcomes is described by a collection 
{Pa : fl G E} C Pos {X) satisfying 

AGE 

Here, 1 ;f denotes the n x n identity matrix, or identity operator on X. (The subscript X is dropped 
when it is implicitly clear.) If the register corresponding to A" is in a state described by the density 
matrix p e D{X), and this measurement described by {?« : a e Z} C Pos (X) is performed, each 
outcome a e E will be observed with probability (Pa, p) . 

2.2 Non-interactive zero-sum quantum games 

In a non-interactive zero-sum quantum game, Alice and Bob each send a quantum state to a ref- 
eree, who performs a measurement on these two states to determine their payoffs. Hereafter we 
will let .4 = C" and B = C" refer to the vector spaces corresponding to the states that Alice and 
Bob send to the referee. 

When the referee performs a measurement to determine Alice and Bob's payoffs, a joint mea- 
surement is used. In other words, Alice's and Bob's states are together viewed as a single state of 
a register. We therefore have that the referee's measurement is described by a collection 

{Ra : fl e E} c Pos {A ® B) 
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that satisfies the condition 

If Alice sends the state p and Bob sends the state a, then each possible measurement outcome 
fl S Z appears with probability {Ra, p®a). 

A payoff for each player is associated with each possible measurement outcome a G E. As we 
consider only zero-sum games, it is sufficient to describe these payoffs by a function : Z ^ R; 
with Alice's payoff for outcome a being v{a) and Bob's payoff being —v{a). For a given choice of 
states p and a, it holds that Alice's expected payoff is given by 

v{a) {Ra,p®a) = {R,p®a) 

for 

Bob's expected payoff is given by — (R,p®a). When one is interested only in the expected payoff 
of a given game, it is therefore sufficient to consider that the game is simply determined by R. We 
will refer to R as a payoff observable, given that a matrix that arises in this way from a measurement 
and a real-valued function on its outcomes is sometimes called an observable. 

A necessary and sufficient condition for a matrix R acting on A(>^ B to arise from some mea- 
surement and real-valued payoff function v as just described is that R is Hermitian. The sort of 
payoff function (p{p, a) discussed in the introduction therefore takes the form (p{p, a) = (R,p ® a) 
for R ranging over the set of Hermitian matrices of the appropriate size. As the tensor product is 
a universal bilinear function, and every real- valued linear function on Herm {A) ® Herm [B) can 
be expressed as an inner product with some Hermitian matrix R, we have that a necessary and 
sufficient condition for (p{p,a) to be a physically valid payoff function is that ^ is a real-valued 
bilinear function. 

Now, given that the sets D {A) and D [B) are convex and compact, and that Alice's expected 
payoff {R,p ® o') is a bilinear function on D [A) x D [B), it follows from well-known extensions 
of von Neumann's Min-Max Theorem |vN28i that 

max min IR, p ® a) = min max (R, p®a). (2) 

peD(^) (rGD(i?) £reD(i?) pGD(^) 

(See, for instance, | jFan53 |.) We define *(R) to be the value of the game determined by R, which 
is the quantity represented by the two sides of the above equation Q. A pair of quantum states 
(p, a) is called an equilibrium point for R if both p and independently achieve the maximum and 
minimum, respectively, in equation |(2]); or, equivalently, that 

min (R,p t7-'\ = (R,p (g) cj) = max (R,p'cg)£7\. 

Again, the existence of an equilibrium point follows easily from equation (|2]). 

We define that an e-approximate equilibrium point of a game with payoff observable R is a pair 
of states [p, a) such that 

max (R,p' ®a) - t\\R\\ < {R,p®a) < min (R,p ct"') + e 11 R 11 . 

Note that this is an approximation in an additive sense, and is relative to the maximum absolute 
value of any payoff (which is reflected by the presence of the factor || R || in the error). 
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2.3 Additional definitions and notation 



This section summarizes some additional terminology and notation that will be used in the paper. 
First, a linear mapping of the form ^ : L (B) L {A) is called a super-operator (as it maps linear 
operators to linear operators). The adjoint super-operator to O has the form O* : L (A) L (B), 
and is uniquely determined by the condition 

(A,0(B)) = 

for all A G L (^) and B G L(^)- A super-operator O : L ^{A) is said to be positive 

if it holds that 0(P) is positive semidefinite for every choice of a positive semidefinite operator 
P G Pos [B). It is the case that O* is positive if and only if O is positive. 

There is a one-to-one and onto linear correspondence between the collection of operators of 
the form R G L[A^ B) and the collection of super-operators of the form O : L (B) L (A), 
which is sometimes known as the Choi-Jamiotkoivski isomorphism. Specifically, for every super- 
operator O : L [B) L [A), one defines an operator 2? G L B), called the Choi-Jamiolkowski 
representation of O, by the equation 

l<i,j<tn 

where Ej-y is the matrix with a 1 in entry (/,/) and in every other entry. Conversely, given an 
operator R gL{A® B), one defines a super-operator O : L [B] L [A) by means of the formula 

0(B) =Tre(R(1^0B")). (3) 

It follows that 

Tr (R(A0B)) = Tr(AO(B^)) 

for every choice of A G L [A) and B G L [B). These correspondences are both linear, and are 
inverse to one another — so one is free to translate back and forth between the two as necessary for 
a given application. The assumption that R is positive semidefinite implies that the corresponding 
super-operator O is positive. (In fact, O has the stronger property of being completely positive if and 
only if R is positive semidefinite.) 

For a given quantum game, we may equally well calculate expected payoffs and equilibrium 
points by using the unique super-operator ^ determined by (jS)) rather than the payoff observable 
R. In particular, {p, a^) is an equilibrium point of the game defining R if and only if 

min (|0,O(t7')> = (|0,O((7)) = max {p' ,^{a)) , (4) 

and the value of this game is alternately expressed as 

a(0) = max min (p, 0(cr)) = min max (p,0((7)). (5) 

;OGD(^) £7GD(S) £rGD(S)pGD(^) 

For a Hermitian n x n matrix A, one denotes the eigenvalues of A by 

Ai(A) > A2(A) > •■■ > A„(A), 

sorted from largest to smallest and including each eigenvalue a number of times equal to its mul- 
tiplicity. For every n x n Hermitian matrix A, the spectral norm is denoted || A || and satisfies 

||A|| =max{|Ai(A)|,...,|A„(A)|}, 
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while the trace norm is denoted || A ||^ and satisfies 

\\A\\^ = |Ai(A)| + --- + |A„(A)|. 

(Note that both of these formulas assume that A is Hermitian.) 

Using the above notation, we may express the equations (|4]) and (O in simpler terms: (p, c""") is 
an equilibrium point of the game defining O if and only if 

Ai(<I>(£7)) = (p,0(^))=A,„(0*(p)), 

while the value of this game satisfies 

= min Ai(0(t7)) = max A^(^*(p)). 

aGD{B) |OGD(yt) 

Finally, for future reference we note that if a payoff observable R satisfies < R < 1, then it 
holds that < 0(cr) < 1 and < < 1 for all choices of density matrices p G D (A) and 

(7 G I){B). Moreover, for arbitrary Hermitian matrices A G Herm (^) and B G Herm(i3), we 
have \\^{B)\\ < \\B\\^ and ||<I>*(A)|| < \\A\\^. 



3 The Main Result 

We now present the main result of the paper, which is a parallel algorithm to approximate the 
value of a non-interactive zero-sum quantum game. This fact is stated as Theorem [T] below, fol- 
lowing a few comments on the assumed form of the input. 

We suppose that a given non-interactive zero-sum quantum game is described by payoff ob- 
servable R G Herm [A ® B), for A = and B = C"' as discussed in the previous section. More 
precisely, we assume that R is given as an nm x nm matrix, along with a specification of the di- 
mensions n and m. Each entry of R is a complex number, which we assume has rational real and 
imaginary parts, each represented as the ratio of two integers expressed in binary notation. We let 
k be the maximum length of the binary representation over all of these integers, and define size(R) 
to be {nm)^k. It is clear that 0(size(R)) bits suffice to encode R. 

In addition to R, n and m, an accuracy parameter £ > is also given as input. For technical 
reasons it is most convenient to assume that e is represented in unary notation: each string 1'', for 
a positive integer r, denotes the value e = 1/r. This assumption on the input form of e reflects 
the fact that our algorithm does not scale well with respect to accuracy — it forces the length of the 
input to be proportional to 1/e rather than log(l/£), and therefore permits our algorithm to be 
described by circuits with size polynomial in the input length. 

The output of the algorithm will be a pair of density matrices (p, c^) where p G D (^) and 
G D [B). They are assumed to be represented in a manner similar to the input matrix R. 

Theorem 1. An e-approximate equilibrium point {p, a) for a given payojf observable R can be computed by 
a logarithmic-space uniform family of Boolean circuits having depth polynomial in log(size(R)) and 1/e. 

3.1 Parallel algorithm for positive games 

Our algorithm is most naturally described for the case that the payoff observable R satisfies 
< R < 1. We therefore begin with this case, which will imply Theorem [T] by an appropriate 
translation and rescaling of R. The algorithm is described in Figured! 
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Algorithm 

1. Let = e/8 and let N = |'641n(nm)/£2] . 

2. Initialize: Aq = 1a, Po = ^o/ Tr(Ao), Bq = Is, and t7o = Bo/ Tr(Bo)- 

3. For each ; from 1 to N, let Aj, pj, Bj, and tJy be as follows: 

Aj = exp (^^Y^^{ai)^ , 
py = A^/Tr(Ay), 

^; = B;/Tr(By). 

4. Output the pair (p, (f^, where 

P = TT Pi and = TT c^i- 

Figure 1: A parallel algorithm for obtaining an e-approximate equilibrium point of a one-round 
zero-sum quantum game. The game is assumed to be described by a payoff observable K satisfy- 
ing < R < 1, which gives rise to a positive map O : L (JS) L (^) as described in Section l23l 



3.1.1 Accuracy of the algorithm 

In this section, the accuracy of the algorithm described in Figure [1] is analyzed. We note that a 
similar type of analysis has appeared in previous works on the multiplicative weights update 
method and its predecessors, and in particular the reader is referred to |Kal07| for information on 
the generality of the approach. 

At this point in the analysis we are concerned only with the idealized algorithm described in 
Figured] — numerical issues concerning the required precision with which the idealized operations 
are performed are discussed in the next subsection. We begin by noting some facts concerning 
matrix exponentials. First, the Golden-Thompson Inequality (see Section IX.3 of [Bha97| ) states that, 
for any two Hermitian matrices X and Y of equal dimension, we have 

Tr < Tr (e^e^) . 

Second is the following simple pair of inequalities concerning the matrix exponential of positive 
and negative semidefinite matrices. 

Lemma 2. Let P be an operator satisfying < P < 1. Then for every real number ji > 0, the following 
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two inequalities hold: 

exp{}iP) < 1 + jiexp{}i)P, 
exp(— /.(P) <1 — jiexp{—}i)P. 

Proof. It is sufficient to prove the inequalities for P replaced by a scalar A G [0, 1], for then the op- 
erator inequalities follow by considering a spectral decomposition of P. If A = both inequalities 
are immediate, so let us assume A > 0. By the Mean Value Theorem there exists a value Aq G (0, A) 
such that 

exp(ffA) — 1 ^ ^gxp(f/Ao) < piexpiji), 
from which the first inequality follows. Similarly, there exists a value Aq G (0, A) such that 

exp(-/^A) - 1 r y \ ^ r \ 

— J = -jiexpi-fiAo) < -iiexp{-}i), 

which yields the second inequality. □ 

We now proceed to the main part of the accuracy analysis, which comprises two bounds on 
the eigenvalues of O* (p) and 0((7), where {p, cr^) is the output of the algorithm. 



Lemma 3. The following inequalities hold: 

A.W.))<5^EM|(p,_„o(.,._,)> + i^, (6) 

A,,(*-(p))>53^|(p,,_,,^(,,_,,)_!=M. (,) 

Proof. Let us begin by noting that each of the operators Aj and Bj (for ; = 0, . . . , N) that are 
obtained during the course of the algorithm are positive definite, and therefore have positive trace. 
It follows that each of the operators pj and aj is a well-defined density operator. 
To prove the first inequality, observe that 

An = exp (^^'^^^{(Tj-l)^ = exp{^N^{(7)) . 

Given that A^ is positive definite, it holds that 

Tr(AN) > Ai(A„) = exp(/^NAi(0((7))), 

and therefore 

A,(0(.)) < (8) 

The inequality (HI) will now follow by bounding ln(Tr(X]v/)), which can be done as follows. 
First, note that we may alternately write 

Aj = exp (ln(Ay_i) + /^0(t7y_i)) 
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for each / > 1, and therefore 

Tr(Ay) =Tr(exp(ln(A^_i)+/^0((7y_i))) < Tr exp 0((7y_i))) 

by the Golden-Thompson inequality. As dj^i is a density operator, it holds that ^{aj^i) < 1, and 
therefore 

by Lemma|2l Thus, using the fact that Tr(Xyi) < Tr(Xy2) for all choices of matrices X, Yi, and Y2 
with X > and Yi < Yz, we have 

Tr(A^) < Tr {1 + iiexp{fi)^{<7j^,))) = Tr(A^_i) {l + ^exp{^) (py_i,0(^7y_i)» . 

It now follows from the inequality 1 + f/ < exp{}i) that 

Tr(A^) < Tr(A^_i)exp {^exp{^) (p^_i,0((7^_i)» . 

Applying this inequality recursively, and using the fact that Tr(Ao) = n, we obtain 

Tt{A^) < exp (^^exp{^)p^ (py_i,0(c7-y_i)> + ln(n)^ . (9) 
Combining ^ and (|9]) yields 

A,(*W)<52^g(ft-,.*(<^,-,)) + i^, 

as required. 

The second inequality follows by similar reasoning, except with a few differences that we now 
highlight. We first observe that 

BN = exp(-^NO*(p)). 

This time we have 

Tr(BN) > M{Bn) = exp(-^NA„,(0*(p))), 

where the switch from the largest eigenvalue to the smallest is caused by the minus sign in the 
exponential function. Thus, 

A„(*-W) > (10) 

The quantity Tr(B]v) is now bounded in the same way as Tr(A]v), except that we need the second 
inequality in Lemma |2l Specifically, the Golden-Thompson inequality implies 

Tr(By) =Tr(exp(ln(By_i)-HO*(py_i))) <Tr(B;-iexp(-^(0*(p^_i))). 

As <I>*(|Oy_i) < 1 we have 

exp(-/^0*(py_i)) < 11 -iiexp{-^)0*{pj_,), 

and therefore 

Tr(By) < Tr(B,_i)exp(-;^exp(-;^) ((7)_i,0*(py_i)». 
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It follows that 

Tr(BN) < exp ^-;,exp(-;^) £ (a)_i,0*(py_i)> +ln(m)^ . (11) 

Combining dlOl ) and fTT] ), along with the fact that ((7^,0* (py)) = (pj,<I>(c7,)) for every choice of;', 
yields 

and completes the proof. □ 
It is now possible to verify that the output {p,cr^) of the algorithm satisfies 

max (R,p' ^a'^) - £/2< (R,p®cr'^) < min (R, p a') + e/2, 

which is expressed in terms of the mapping <I> as 

Ai(0((7))-£/2< (p,0(^7)) <\,n{^\p))+e/2. 
It follow from Lemma |3] that 

and given that each of the quantities (<7}-i, ^{pj-i)) is at most 1, we have 

, \\ , /^*/ NX ^ . -1 / N In(nm) ^ ]n(nm) ^ _ 

jiN jiN 

Thus, given that A^(O*(j0)) < {p,^{cr)) < Ai(0(cr)), we have 

Ai(0((7))-£/2< A,„(0*(p)) < (p,0((7)) < Ai(0*(p)) < A,(0(c7))+e/2 (12) 

as claimed. 

3.1.2 Numerical precision and complexity of the algorithm 

Let us now consider the complexity of the algorithm described in Figure [T] It is the goal of this 
section to demonstrate that this algorithm can be implemented, by a logarithmic-space uniform 
family of Boolean circuits with depth polynomial in log(size(K)) + 1/e, with sufficient accuracy 
to obtain an e-approximate equilibrium point for the input payoff observable R. Throughout the 
analysis, we (sometimes grossly) overestimate errors for the sake of simpler expressions involving 
as few variables as possible. 

Each iteration performed in step 3 of the algorithm requires the evaluation of O and O*, two 
matrix exponential computations, and a constant number of elementary matrix operations (in this 
case: addition, scalar multiplication, and computation of the trace). Were it not for the matrix 
exponentials, it would be straightforward to perform all of the required operations within the 
claimed size and depth bounds using exact computations. Given that the matrix exponentials 
will generate irrational numbers, however, we must settle for approximations over the course of 
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the algorithm. To guarantee that the algorithm is sufficiently accurate, it will suffice to perform 
all computations to within an additive error of (e/2) exp(— 8N^), as is shown below. (We could 
afford to take a much smaller error with respect to size(R), but there is no need to do this.) 

Let us begin by making a few simple observations about the matrices computed throughout 
the course of the algorithm. The matrices do, ■ ■ ■ , c^n-i are density matrices, and therefore it holds 
that II <I>(t7"j) II < 1 for each choice of / = 0, . . . , N — 1. Likewise, || O* {pj) || < 1 for each choice of 
/ = 0, . . . , N — 1. Consequently, we have \\Aj\\ < , || By || < , 



1 < Tr(Ay) < e 



,2N 



and 



e-^ < Tr(By) < e 



N 



for; = 0,...,N- 1. 

Next, let us represent the actual matrices computed during the course of the algorithm by 
placing a tilde over the variables representing the idealized values that are expressed in Figure [H 
It will suffice to prove that ||p — < e/2 and ||c7" — C7||-^ < e/2, for then the inequalities 



|Ai(<D(t7))-Ai(^(5^))| < - and 



|A,(0*(p))-A, (O* (^5))|<- 



hold. Combined with ((12)) , we obtain the required accuracy. 

Now, each iteration of step 3 of the algorithm will introduce some error into the calculation 
of the final answer. Let us consider the 7-th iteration, and assume that a positive real number 
Sj G (0, 1) is given such that \\(ri — ai ||^ < 5j for i = 0, . . . ,j — 1. Let us define 

;-i ^ ;-i 

Xj = ^Y^ ^{ai) and Xj = ^J2^ i^i) ■ 

!=0 i=Q 



Then 



Xj - Xj 



< SjN and || || < N, and therefore 
exp(Xy) - exp (Xj 



< 



Xj - Xj 



where the first inequality follows from Corollary 6.2.32 of | |HJ91[ . By computing the matrix expo- 
nential with accuracy Sj we therefore have 
follows that 



Aj - Aj 



< Sje'^'^, and thus 



Aj - Aj 



5N 



It 



\Pi-P'\\^^TF(Aj 



Aj - Aj 



A, 



MAj) Tr (a 



< Sje'^. 



By similar reasoning, if it holds that Hp; — |0; ||i < ^j for f = 0, — 1, then 



Tr(B;) 



Bj - Bj 



+ 



Tr{Bj) 



Tr B 



We conclude from these bounds that taking 3j = (e/2)e guarantees that |||0 — |0||i < e/2 and 
Ik-S'lli < e/2. 

The required precision for the matrix exponentials is easily obtained by taking sufficiently 
many terms in the series = l + X + X'^/2 + X^/6 + -- -. (This of course is not the most efficient 
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way to compute matrix exponentials, but it suffices to prove the main theorem.) For instance, 
taking 9N^ terms guarantees that the required accuracy {e/2)e~^^^ is achieved. 

At this point, the parallel complexity of the algorithm is easily bounded. Each of the matrices 
stored by the algorithm has entries whose real and imaginary parts are represented in binary 
notation using O(N^) bits. For each iteration in step 3 of the algorithm, the evaluations of O and 
O*, as well as the elementary matrix operations, may therefore be performed by standard parallel 
algorithms (see, for instance, fGat93J) by logarithmic-space uniform Boolean circuits (with size 
that is necessarily polynomial in size(R) and 1/e given this uniformity constraint), within depth 
that is polynomial in log(size(R) ) and 1/e. The number of iterations performed is N, which results 
in total depth polynomial in log(size(K) ) and 1/e. 



3.2 Extensions to arbitrary payoff observables 

For an arbitrary payoff observable R, the algorithm from the previous section is not guaranteed to 
function correctly, as we have used the positivity of the corresponding super-operator O several 
times during the analysis. 

It is straightforward, however, to translate and scale an arbitrary payoff observable in a way 
that allows the algorithm to be used. For an arbitrary positive semidefinite payoff observable R, 
this is essentially trivial — one simply runs the algorithm on the payoff observable P = R/ ||R||. 
For a negative semidefinite payoff observable R, one simply exchanges the roles of Alice and Bob 
and considers the payoff observable — R (with the spaces A and 13 swapped). 

Let us now consider the general case of a payoff observable R for which Ai (R) > > A„,„ (R). 
Define 

R-A„„,(R)1 
Ai(R)-A„„,(R)- 

Then < P < 1, and so the algorithm from the previous section may be used to obtain an e- 
approximation {p, a) for P. The point {p, a) is easily verified to be a ^-approximate equilibrium 
point for R, where 

Ai(R)-A„^(R) ^^^^_ 



4 QRG(l) is contained in PSPACE 

Quantum interactive proof systems with two competing provers are naturally represented as 
games between two competing players, moderated by a referee. The two players (Alice and Bob) 
play the roles of competing provers, while the referee corresponds to the verifier. Quantum refer- 
eed games have been studied in IGWOS I iGutOS I |GW07| . and represent a quantum analogue to the 
classical refereed games model studied in llFK97ll . 

The simplest form of a refereed quantum game has the general form defined in Section |2j 
meaning that there is no communication from the referee to the players. The players each send 
a quantum state and the referee measures to determine the winner. With this picture in mind, 
one defines the complexity class QRG(l) to be the class consisting of all promise problems A = 
(Ayes, ^no) for which there exists a polynomial-time uniform family Q = {Qn '■ n ^ N} of quan- 
tum circuits, where each circuit Q„ takes n + 2p(n) input qubits for some polynomial bounded 
function p, such that the following properties hold: 
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• For every string x € Ayes it holds that 

2 

maxmmPr[Q(x,p,cr) = !]>-. 

• For every string x G Ano it holds that 

1 

maxmuiPr[Q(x,p,cr) =!]<-. 

p 0- 3 

Here, the maximum and minimum are both over all quantum states on p(|x|) qubits, and the 
notation Q{x,p,cr) = 1 is shorthand for the event that a measurement of some fixed output qubit 
of the circuit Q|^| (with respect to the standard basis) yields 1, assuming that the input to the circuit 
is the state |x) (x| (8) p (7. The name QRG(l) refers to the fact that these are quantum refereed 
games with 1 turn, during which the players send quantum states to the referee in parallel. 

The class QRG(l) may be viewed as a simple variant of QMA, where there are two competing 
provers rather than a single prover. It is obvious that QMA C QRG(l), and that QRG(l) is closed 
under complementation (and thus co-QMA C QRG(l)). 

The class QRG(l) may roughly be thought of as a quantum analogue to the class that was 
defined by |Can 96| and [RS98J, and it is easily observed that Sf C QRG(l). There is one subtlety, 
however, which is that the definition of QRG(l) does not allow one prover to see the other's 
message (which would not make sense in the quantum setting anyway), whereas the standard 
definition of S2 does. 

Proposition 4. QRG(l) C PSPACE. 

Proof. Suppose that A is a promise problem contained in QRG(l), and that {Qn} is a polynomial- 
time uniform family of quantum circuits that witnesses this fact. For each input x, let Rx denote 
the payoff observable that corresponds to the game played by the players Alice and Bob on input 
X, where the payoff for Alice is defined as 1 for acceptance and for rejection. The expected payoff 
is therefore the probability of acceptance, which Alice tries to maximize and Bob tries to minimize. 

We denote by NC (po/y) the class of promise problems computed by polynomial-space uniform 
Boolean circuits with polynomial depth. It holds that NC (po/y) C PSPACE [Bor77J, so it therefore 
suffices to prove that A € NC (po/y). This is easily accomplished by composing three families of 
Boolean circuits: 

1. A family of Boolean circuits that outputs a description of the payoff observable Rx associated 
with the game on input x. 

2. The family of Boolean circuits given by Theorem [H that finds an e-approximate equilibrium 
point (p, a) of the payoff observable R^, for e = 1/8. 

3. A family of Boolean circuits that computes the expected payoff {Rx,p a), and accepts if the 
value is greater than 1/2 (rejecting otherwise). 

The first family is easily derived from the circuits {Qn }, by computing the product of a polyno- 
mial number of exponential-size matrices that correspond to the quantum gates of the appropriate 
circuit Qn- This family may be taken to be polynomial-space uniform, with polynomial depth. The 
second family is, as suggested above, given by Theorem [TJ This family is logarithmic-space uni- 
form and has polynomial-size and poly-logarithmic depth with respect to size(Rj). Thus, with re- 
spect to the input length \x\, this family is polynomial-space uniform, and has polynomial depth. 
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The last family is required only to perform elementary matrix and arithmetic operations, and can 
be taken to have similar properties as the first two: polynomial-space uniformity and polynomial 
depth. Composing these families appropriately demonstrates that A G NC{poly) as required. □ 



5 Parallel Approximation of Positive Semidefinite Programs 

We now discuss the connection between equilibrium points of non-interactive zero-sum quantum 
games and semidefinite programs. The main focus of this section will, in particular, be on positive 
instances of semidefinite programs, and on the question of whether good parallel methods to ap- 
proximate them exist. We will first discuss the general notion of positive instances of semidefinite 
programs and then explain how our algorithm may be used in their approximation, albeit with 
poor accuracy in some cases. 

The multiplicative weights update method has been applied to semidefinite programming in 
IIAHKOSal IAK07| , and the general connection between equilibrium points of different types of 
games and linear /semidefinite programs is well-known. Once again, the reader is referred to 
Kale llKal07|l for further details and historical remarks. 

5.1 Positive instances of semidefinite programs in super-operator form 

Suppose that the following input has been given: 

1. a Hermitian matrix A G C"^", 

2. a Hermitian matrix B eC"^"^, and 

3. a linear mapping O : C'"^'" —>^ (2"^" (i.e., a super-operator) that preserves Hermiticity. 

To say that O preserves Hermiticity means that 0(y) is Hermitian for every choice of a Hermitian 
matrix Y G C'"^'". This condition is equivalent to the Choi-Jamiotkowski representation R of O 
being a Hermitian matrix. 

Given this input, let us consider the following semidefinite programming problem, which we 
say is in the super-operator form: 

Super-operator primal form Super-operator dual form 

maximize: (B/^) minimize: (^/X) 

subject to: 0(y) < A, subject to: 0*(X) > B, 
y > 0. X > 0. 

Here, X and Y range over all (positive semidefinite) matrices in C"^" and C""^"", respectively. This 
form is completely general: it is possible to translate semidefinite programs in so-called standard 
form to the super-operator form, and vice versa. It can be shown that strong duality holds for 
semidefinite programs in the super-operator form under conditions that are similar to those for 
standard form semidefinite programs. In particular, the existence of either of the following implies 
that strong duality holds: 

1. a positive definite matrix Y for which 0(y) < A, or 

2. a positive definite matrix X for which O* (X) > B. 
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We define that such a problem instance is positive if A and B are positive semidefinite matrices 
and O is a positive super-operator. Let us also define that such a problem is strictly positive if it 
holds that A and B are positive definite and ^ is a strictly positive super-operator (which means 
that 0(1) is positive definite). Strong duality necessarily holds for all strictly positive semidefinite 
programs in the super-operator form. 

5.2 Parallel approximation of strictly positive semidefinite programs 

Parallel algorithms for approximately solving positive linear programs have been given by Luby 
and Nisan [LN93J and Young IIYou01| . To our knowledge, an analogous problem for semidefinite 
programs has not been considered. 

The algorithm from Section 13.11 can be used to approximate, in parallel, strictly positive in- 
stances of semidefinite programs as we now explain. First, let us note that an arbitrary strictly 
positive semidefinite program in the super-operator form can be transformed into one of the fol- 
lowing simpler form: 

Primal Dual 
maximize: Tr(y) minimize: Tr(X) 

subject to: 0(y) < 1, subject to: 0*(X) > 1, 

y > 0. X > 0. 

This may be done by defining 

o(y) = Y (b-'2yb-^) a-t. 



for a given problem instance defined by A > 0, B > 0, and a strictly positive super-operator Y. 

Now, to make the connection with the algorithm from the previous section clear, let us recall 
that we define ^ = C" and B = C"', and suppose that the super-operator O that represents the 
above semidefinite program takes the form ^ : L (S) L {A). Let us also define opt(^) to be the 
optimal value of the primal problem (which is the same as the optimal value of the dual problem 
by strong duality). It is clear that opt(O) > 0, for some positive scalar multiple of the identity 
must be primal feasible. Let us also recall that we have defined 

a(0) = max min (|0,O((7-)). 

|DGD(^) (7GD(i3) 

Proposition 5. For all strictly positive super-operators <I> we have a(<I>) = 1/ opt(O). 
Vrooj. Let (p, c) be an equilibrium point of O, meaning that 

Ai(0((7)) = (p,0(^7)) = A„(0*(p)) = a(0). 

The assumption that O is strictly positive implies that a(0) is positive. 

We now observe that cr/a(0) is primal feasible, as it is positive semidefinite and satisfies 

which implies 0(cr/a(0)) < 1^. Likewise, |0/ft;(O) is dual feasible as it is positive semidefinite 
and satisfies 

A„,(^*(p/a(0))) =1, 

which implies ^*(|0/a(O)) > Ig. Both result in the same objective value l/a(^), and so the 
proposition follows by (weak) duality. □ 
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It follows that the algorithm from Section |3H may be used to approximate opt(O), albeit with 
limited accuracy for some choices of O, by taking the reciprocal of the value of the game associated 
with O. To be more specific, let R be the Choi-Jamiolkowski representation of the super-operator 
O as discussed in Section |231 Let us write to denote the approximate value of the game 
described by R that results from the algorithm's e-approximate equilibrium point of R, and let us 
also write opt(<I>) = l/a(0). We then have 

and therefore 

For choices of O for which is large and ||R|| is small (bounded below and above by 
constants, say), a reasonable approximation to opt(O) may be obtained. For many choices of ^, 
however, our method is clearly not suitable, and we believe it is an interesting problem for future 
research to find more accurate parallel algorithms for this problem. 

6 Conclusion 

In this paper we have shown that equilibrium points of non-interactive zero-sum quantum games 
can be efficiently computed in parallel, using the multiplicative weights update method. As a 
consequence, we have that one-turn quantum refereed games can be simulated in polynomial 
space, or QRG(l) C PSPACE. We have also illustrated the connection between values of quantum 
games and positive instances of semidefinite programming problems. 

The main open question that we wish to raise concerns the existence of efficient parallel algo- 
rithms for positive instances of semidefinite programming problems. The class of such problems 
for which our algorithm gives accurate solutions is limited. To what extent can this task be per- 
formed for more general classes? 

Acknowledgements 

The parallel algorithm presented in this work was inspired by an algorithm appearing in an un- 
published joint work, concerning classical games, of Rohit Khandekar and the first author. We 
therefore thank Rohit Khandekar for his implicit and indirect contribution to this work, and for 
allowing us to include it in this paper. Rahul Jain's research is supported by ARO/NSA USA, 
and John Watrous's research is supported by Canada's NSERC and the Canadian Institute for 
Advanced Research. 

References 

[AHKOSa] S. Arora, E. Hazan, and S. Kale. Fast algorithms for approximate semidefinite pro- 
gramming using the multiplicative weights update method. Proceedings of the 46th 
Annual IEEE Symposium on Foundations of Computer Science, pages 339-348, 2005. 

[AHKOSb] S. Arora, E. Hazan, and S. Kale. The multiplicative weights update method: a meta 
algorithm and applications. Manuscript, 2005. 



16 



[AK07] S. Arora and S. Kale. A combinatorial, primal-dual approach to semidefinite pro- 
grams. Proceedings of the Thirty-Ninth Annual ACM Symposium on Theory of Computing, 
pages 227-236, 2007. 

[AmbOl] A. Ambainis. A new protocol and lower bounds for quantum coin flipping. In Proceed- 
ings of the Thirty-Third Annual ACM Symposium on Theory of Computing, pages 134-142, 
2001. 

[Bha97] R. Bhatia. Matrix Analysis. Springer, 1997. 

[Bor77] A. Borodin. On relating time and space to size and depth. SIAM Journal on Computing, 
6:733-744, 1977. 

[Can96] R. Canetti. On BPP and the polynomial-time hierarchy. Information Processing Letters, 
57:237-241, 1996. 

[CHTW04] R. Cleve, R Hoyer, B. Toner, and J. Watrous. Consequences and limits of nonlocal 
strategies. In Proceedings of the 19th Annual IEEE Conference on Computational Complex- 
ity, pages 236-249, 2004. 

[Fan53] K. Fan. Minimax theorems. Proceedings of the National Academy of Sciences, 39-A2-A7, 
1953. 

[FK97] U. Feige and J. Kilian. Making games short. In Proceedings of the Twenty-Ninth Annual 
ACM Symposium on Theory of Computing, pages 506-516, 1997. 

[Gat93] J. von zur Gathen. Parallel linear algebra. In.]. Reii, editor. Synthesis of Parallel Algo- 
rithms, chapter 13. Morgan Kaufmann Publishers, Inc., 1993. 

[Gut05] G. Gutoski. Upper bounds for quantum interactive proofs with competing provers. 

In Proceedings of the 20th Annual IEEE Conference on Computational Complexity, pages 
334r-343, 2005. 

[GW05] G. Gutoski and J. Watrous. Quantum interactive proofs with competing provers. In 
Proceedings of the 22nd Symposium on Theoretical Aspects of Computer Science, volume 
3404 of Lecture Notes in Computer Science, pages 605-616. Springer, 2005. 

[GW07] G. Gutoski and J. Watrous. Toward a general theory of quantum games. In Proceedings 
of the 39th ACM Symposium on Theory of Computing, pages 565-574, 2007. 

[HJ91] R. Horn and C. Johnson. Topics in Matrix Analysis. Cambridge University Press, 1991. 

[Kal07] S. Kale. Efficient Algorithms Using the Multiplicative Weights Update Method. PhD thesis, 
Princeton University, 2007. 

[Kit02] A. Kitaev. Quantum coin-flipping. Presentation at the 6th Workshop on Quantum 
Information Processing (QIP 2003), 2002. 

[KKM'^08] J. Kempe, H. Kobayashi, K. Matsumoto, B. Toner, and T. Vidick. Entangled games are 
hard to approximate. In Proceedings of the 49th Annual IEEE Symposium on Foundations 
of Computer Science, 2008. 



17 



[KKMV08] J. Kempe, H. Kobayashi, K. Matsumoto, and T. Vidick. Using entanglement in quan- 
tum multi-prover interactive proofs. In Proceedings of the 23rd Annual Conference on 
Computational Complexity, 2008. 

[KM03] H. Kobayashi and K. Matsumoto. Quantum multi-prover interactive proof systems 
with limited prior entanglement. Journal of Computer and System Sciences, 66(3), 2003. 

[LN93] M. Luby and N. Nisan. A parallel approximation algorithm for positive linear pro- 
gramming. Proceedings of the 25th Annual ACM Symposium on Theory of Computing, 
pages M8-A57, 1993. 

[Moc04] C. Mochon. Quantum weak coin-flipping with bias of 0.192. In 45th Annual IEEE 
Symposium on Foundations of Computer Science, pages 2-11, 2004. 

[Moc07] C. Mochon. Quantum weak coin flipping with arbitrarily small bias. Available as 
arXiv.org e-print 0711.4114, 2007. 

[RS98] A. Russell and R. Sundaram. Symmetric alternation captures BPP. Computational Com- 
plexity, 7:152-162, 1998. 

[SR02] R. Spekkens and T. Rudolph. Quantum protocol for cheat-sensitive weak coin flipping. 
Physical Review Letters, 89(22):227901, 2002. 

[TRW05] K. Tsuda, G. Ratsch, and M. Warmuth. Matrix exponentiated gradient updates for 
on-line learning and Bregman projection. The Journal of Machine Learning Research, 
6:995-1018, 2005. 

[vN28] J. von Neumann. Zur theorie der gesellschaftsspiele. Mathematische Annalen, 
100(1928):295-320, 1928. 

[YouOl] N. Young. Sequential and parallel algorithms for mixed packing and covering. Pro- 
ceedings of the 42nd IEEE Symposium on Foundations of Computer Science, pages 538-546, 
2001. 



18 



